ROOTS OF POLYNOMIALS OF DEGREES 3 AND 4 



SVANTE JANSON 

Abstract. We present the solutions of equations of degrees 3 and 4 
using Galois theory and some simple Fourier analysis for finite groups. 



1. Introduction 

The purpose of this note is to present the solutions of equations of degrees 
3 and 4 (a.k.a. cubic and quartic equations) in a way connected to Galois 
theory. This is, of course, not the historical path; the solutions were found 
by del Ferro, Tartaglia, Cardano (Cardan) and Ferrari in the 16th century, 
see Appendix |Bj about 300 years before Galois theory was created. But in 
retrospect, Galois theory yields the solutions rather easily. More precisely, 
we will see below that Galois theory, together with some simple Fourier 
analysis for (small) finite Abelian groups, suggests the crucial constructions 
in the solutions; however, all calculations are easily verified directly, and 
we do not really need any results from Galois theory (or Fourier analysis) 
for the solution. Nevertheless, we find it instructive to use Galois theory 
as much as possible in order to motivate the constructions. (See also [2ll . 
Section 8.8] for a similar, but not identical, treatment.) The Galois theory 
used here can be found in e.g. 0], 0], [§], (§] or 21]. 



The appendices contain comments on the history of the problem, other 
solutions, and (Appendix|A]) the complications that may arise when we work 
with real numbers instead of complex. 

Remark 1.1. In contrast, full use of Galois theory is needed to show the 
impossibility of similar formulas for solutions of equations of degree 5 or 
more. This will not be discussed here; see instead e.g. 0], 0], @], [a] or 21]. 



We let throughout K be a field with characteristic 0. (Actually, every- 
thing in this note is valid also for a field K of positive characteristic p ^ 2, 3. 
However, the cases when the characteristic is 2 or 3 are different since we 
divide by 2 and 3 in the formulas below; there are also problems with sepa- 
rability in these cases.) 

The roots of a polynomial in K[x] are, in general, not elements of K, so we 
will work in some unspecified extension of K. This extension could be the 
algebraic closure K of K or some other algebraically closed field containing 
K; in particular, if K = Q or another subfield of C (as the reader may 
assume for simplicity), we can work in C. 
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For simplicity we consider monic polynomials only. (It is trivial to reduce 
to this case by dividing by the leading coefficient.) 

2. Polynomials of degree 3 

Let f(x) = x 3 + bx 2 + cx + d, with b,c,d£ K, be a polynomial of degree 
3, and let a%, 02, «3 be its roots in some extension of K. Thus 

f{x) = x 3 + bx 2 + cx + d = (x — a\)(x — a.2){x — 03). (2-1) 

It is convenient to make the translation x = y — 6/3, converting f(x) into 

9(y) ■■= f(y-b/3)=y 3 +py + q (2.2) 

(without second degree term) for some p,q E K. (Such polynomials, without 
the second highest degree term, are called reduced or sometimes depressed.) 
Thus g has the roots fa, fa, fa with ft = 0{ + 6/3, so a, = ft — 6/3, i = 
1, . . . , 3. Hence, 

g(y) = y 3 + py + q = (y-Pi)(y-^)(y-^)- (2.3) 

Consequently, identifying coefficients, 

+ & = 0, (2.4) 

frft + ftft + /3 2 /?3 = P, (2.5) 

/01/32& = (2.6) 

Remark 2.1. Explicitly, 

p = C -|6 2 , (2.7) 

q = d-\bc+ ^6 3 . (2.8) 
The polynomials / and g have the same discriminant 

A := Dis(/) = Dis(g) := ]J (a, - a,) 2 = J[ (ft-/?,) 2 . (2.9) 

l<i<j<3 !<«<i<3 

Since A is a symmetric polynomial in 01,02,03 (or Pi, fa, P3), it can be 
written as a polynomial in the coefficients of / or g. A well-known calculation 
yields, see e.g. [141 ] . 

A = 6 2 c 2 - 4c 3 - 46 3 d + 18bcd - 27d 2 = -4p 3 - 27q 2 . (2.10) 

We also define the square root of A: 

5:= H {a i -a j )= \[ (ft -/?,•) = VK. (2.11) 

l<i<j<3 1<«<J<3 

Note that while A is independent of the ordering of the roots, the sign of 5 
may change if we permute 0.1,0.2,013. More precisely, the sign is preserved 
by an even permutation but is changed by an odd permutation. 

Let E = K(ai, 02, 03) = K(fa, fa, fa) be the splitting field of /, or g, over 
K, and let G := Gal(£^ : K) be the Galois group of the extension E D K. 
The elements of the Galois group G permute the roots 04 (or ft), and G 
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may be regarded as a subgroup of the symmetric group 53. As said above, 
a (5) = 5 if a £ G is an even permutation, while a(5) = —5 if a is odd. 
Since K C K{5) C. E, E is & Galois extension of K(5), and the Galois group 
G&1(E : K(5)) is the subgroup of G fixing 5: 

G&l(E : K{5)) = {a G G : <r(5) = 5} = {cr G G : a is even} = GnA 3 (2.12) 

(where ^3 is the alternating group consisting of all even permutations in S3), 
at least if 5 7^ 0, or equivalently A / 0, which is equivalent to / separable 
(i.e., / has no multiple roots in K). In particular, if / is irreducible, in which 
case G is transitive and thus G = S3 or A3, G&l(E : K(5)) = A3, which is the 
cyclic group C3. Moreover, in this case, G&l(E : K(5)) = A3 = C3 acts on 
the vectors (01,02,03) and ((3i, fa, fa) by cyclic permutations; equivalently, 
if we regard these vectors as functions on Z3 = C3, Gsl(E : K(S)) = C3 acts 
by translations in C3. This suggests using Fourier analysis, or equivalently 
group representation theory, for ^3 = C3. (For Fourier analysis on finite 
Abelian groups, see e.g. [3j; in this case, the Fourier transform is often 
called the discrete Fourier transform. The more complicated theory of group 
representations for general finite groups is treated by [2Cj|.) 



Remark 2.2. The method below was given by Lagrange [ItJ in 1770-1771 



as part of his study of equations of higher degree, see [J, Sections 8.3, 12.1 
and p. 14]. The method is thus some decades older than both Galois theory 
and Fourier analysis. In this context, the Fourier transforms u and v in 
(|2.17p - (|2.18p below (or rather 3u and 3v) are known as the Lagrange resol- 
vents for the equation. (They were also used earlier by Bezout and Euler 
@, p. 46], and at the same time by Vandermonde [22l |. but Lagrange made 
a profound use of them.) Lagrange and others used this method to study 
equations of arbitrary degree, see [J, Sections 8.3 and 12.1] and, for exam- 
ple, the solutions by Vandermonde and Malfatti of some quintic equations 
givm m @, Chapters 7-8]. This is an important forerunner of Galois the- 
ory. In retrospect, the Lagrangre resolvents can perhaps also be seen as the 
beginning of discrete Fourier analysis. 

We assume, for simplicity, that KCC, and we then define 

1 \/3 

a; := exp(2vri/3) = -- + 2L_i, (2.13) 

a primitive third root of unity. Note that 

uj 3 = 1, l + w + w 2 = 0, (2.14) 

which will be used repeatedly below without further comment. 

Remark 2.3. For a general field K, not necessarily contained in C, we can 
let uj be a primitive third root of unity in K. It is easily verified that the 
formulas below make sense, and are correct, in K, so the result holds in full 
generality. 
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We then define the Fourier transform of a vector (zi, Z2,z%) (regarded as 
a function on Z3 = C3) as (Si, 2b, £3)5 with 

% := l(z 1 +^ k - 1 h 2 +u J -^ k - 1 h 3 ), (2.15) 

and note the Fourier inversion formula, which in this case is easily verified 
directly, 

z k = z 1 + cj k ~ 1 Z2 + w 2 (*-%3. (2.16) 

Hence, if we define 

u := |(ai + Lu 2 a 2 + wa 3 ) = + u; 2 /3 2 + w/3 3 ), (2.17) 

v := i(ai + Lua 2 + w 2 a 3 ) = + w£ 2 + w 2 /3 3 ), (2.18) 

and note that 

-i& = i(ai+a 2 + a3), (2.19) 

= f (ft (2.20) 

we see that the Fourier transforms of the vectors (a±, a 2 , 03) and (/?i, /?2j A) 

are (—^b,u,v) and (0, u, t>), respectively. Consequently, the inversion for- 
mula (|2.16p yields 

ai = -\b + u + v, (2.21) 

a 2 = -\b + uju + co 2 v, (2.22) 

a 3 = -lb + uj 2 u + tov, (2.23) 

and, equivalently, 

(3i=u + v, (2.24) 

/? 2 = uj U + o; 2 <y, (2.25) 

/3 3 = u 2 u + wd. (2.26) 

To solve the equation f(x) = 0, it thus suffices to find u and v. 

The objects u and v are elements of the field E(uj), which is the split- 
ting field of / (or g) over K(u). It is thus a Galois extension of K(u), 
and also of the intermediate field K(5,uj). An element of the Galois group 
G&l(E(u) : K(u})) maps E into itself (because it fixes K and E is a normal 
extension of K), and thus its restriction to E is an element of Ga\(E : K). 
This defines a group homomorphism Gal(E(u) : K{uj)) —> Ga\(E : K), which 
is injective because E[oS) is generated by E and K(uj); thus we can regard 
Gal(E(u}) : K(u)) as a subgroup of Gal(£ : K). Similarly, G&\{E(u) : K(5, 00)) 
is a subgroup of Gal(E : K(S)). 

Let if := Gal(£(w) : if (<5, w)). Then il C Gal(£ : if (5)) C A 3 , so if il is 
not trivial, then H = A3 and H is generated by a cyclic permutation a with 
cr(ctk) = otk+i (with indices modulo 3). Then, by (12.17p - (|2.18|) . cr(u) = lou 
and cr(f) = co 2 v. Consequently, a(v?) = u 3 , and a(v 3 ) = t> 3 . This implies 
that u 3 and v s are fixed by the Galois group H, and thus u 3 and v 3 belong 
to the fixed field Fvx.e(u)(H) = K(5,u). We can find them as follows, using 
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x 3 — 1 = (x — 1) (x — ui) (x — w 2 ) and thus x 3 — y 3 = (a; — y)(x — ioy)(x — 0J 2 y): 
First, by (|2T24j^([23H| ) and flZEJ, 

u 3 + w 3 = (n + «)(« + + u 2 v) = Pxfofo = ~q- (2.27) 

Next, by (|2T7|l - d21gl> . 

m-u = i(w 2 -(j)(a 2 -a 3 ) (2.28) 

ti-w« = |(1 -w)(ai - a 3 ) (2.29) 

i(l -w 2 )(ai -a 2 ) (2.30) 



it — w 2 i> 



and thus, using (|2.1ip . 

u 3 — v 3 = (u — v)(u — ujv)(u — uPv) = («i — a 2 )(ai — a 3 )(a 2 — a 3 ) 

9 



\/3i r / A 
27' 



(2.31) 



Remark 2.4. The choice of square root in (|2.31|) is not important, since 
a change of sign of it is equivalent to an interchange of u and v, which 
just permutes the roots a 2 and a 3 (/3 2 and /3 3 ).) This reflects the fact that 
if the Galois group Gal(E(u) :K(to)) contains an odd permutation r, then 
t(u) = uj j v and t(v) = uj^u for some j = 0,1,2; thus t(u 3 ) = v 3 and 
t(v 3 ) = u 3 . 

We thus find, recalling (|2.10p . 



3 _ ~H + V~ A / 27 _ 1 , .h^__ Q , JfP\ 3 , fQ\ 2 



ui= ,, ^ -,- = _ i+v _ f = _ i+ ( , 32) 



We then find u and v by taking cube roots. In order to choose the right 
roots, we also compute, from ([2T7]) - (|2TIS]) and (|2T4V(l23jh 

«w = 5 (/?i 2 + /3 2 2 + /3 3 2 + (w + ^ 2 )(A/3 2 + /3i/3 3 + ftft)) 

= I {{Pi + h + /?s) 2 - 3(/3i/3 2 + ftft + (2.34) 

If we replace u by the alternative cube root uju or u 2 u, we thus have to 
replace v by a; 2 ?; or ojv, respectively, which by (I2.2ip ~ (|2.23p and (|2.24p ~ 
(|2.26p yields a cyclic permutation of the roots «i,a 2 ,a 3 or /3i,/3 2 ,/3 3 . We 
summarize: 

Theorem 2.5 (Cardano's formula). The roots of g(y) = y 3 + py + q are 

given by 

_ / „ frm* ttt"? _ / „ / s , „s 9 

(2.35) 
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where the two square roots are chosen to be the same, and the two cube roots 
are chosen such that their product is —p/3; this gives 3 choices for the cube 
roots, which gives the 3 roots of g(y) = 0. (In the exceptional casep = q = 0, 
the only root is counted thrice.) 

Equivalently, the roots of f(x) = x 3 + bx 2 + cx + d are given by 



(2.36) 

A 

2 V 108' 

with p and q given by (|2.7|) — ()2.8[) . and A given by ()2.10j) . 

Remark 2.6. This formula is known as Cardano's formula since it was first 
published by Cardano in Ars Magna although it is attributed by him to 
Scipione del Ferro, see Appendix [Bl 

Remark 2.7. The case p = q = is exceptional because then (and only 
then) u 3 = v 3 = 0. This is the trivial case when / and g are cubes (x + b/3) 3 
and y 3 and thus have triple roots —6/3 and 0, respectively. 

The case with a double (but not triple) root are handled correctly by 
Theorem 12.51 This is the case when A = (but not p = q = 0), and thus 
u 3 = v 3 (7^ 0). We can find a cube root u = v with uv = u 2 = —p/3, 
and then the other eligible pairs of cube roots are {uju,uj 2 u) and (w 2 «,wti), 
yielding the roots f3± = 2u, /?2 = /?3 = —u, and thus a\ = —6/3 + 2u, 
Ot2 = «3 = —6/3 — u. 

Similarly, there are no problems in the case when u 3 or v 3 is 0, but not 
both. This happens, by (|2~32j^(pl«| ) and (|234l) . when p = but q / 0. 
Choosing the square root such that v 3 = 0, we have u 3 = —q; the polynomial 
g{y) equals y 3 + q which has the three roots u, cou, uj 2 u. 

Remark 2.8. By (12371) and (j2T34l) . which implies u 3 v 3 = -p 3 /27, u 3 and 
v 3 are the roots of the quadratic resolvent 

r(x) := x 2 + qx - p 3 /27 e K[x}. (2.37) 

Note that the quadratic resolvent has discriminant, by (|2.31j) . 

Dis(r) := (u 3 - v 3 ) 2 = - A = _ 1 Dis(/). (2.38) 

3. Polynomials of degree 4 

Let f(x) = x 4 + bx 3 + cx 2 + dx + e, with 6, c,d,e 6 K, be a polynomial of 
degree 4, and let 01,02,03,04 be its roots in some extension of K. Thus 

f(x) = x 4 + bx 3 + cx 2 + dx + e = (x — a\){x — 02){x — «3)(x — 04). (3.1) 

It is convenient to make the translation x = y — 6/4, converting f(x) into 

9(y) ■= f(y - b/A) = y 4 +py 2 + qy + r (3.2) 
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(without third degree term) for some p,q,r G K. Thus g has the roots 
Pi, P2, 03: Pi with 0i = a.i + 6/4, so a* = $ - 6/4, i = 1, . . . ,4. Hence, 

<7(y) = y 4 + P y 2 + qy + r = (y - fa){y - foXv - fo)(v - P*)- (3.3) 
The polynomials / and g have the same discriminant 

A := Dis(/) = Dis(g) = J] (<* - atf = J] (A - Pjf. (3.4) 

l<i<j<4 i<i<j<4 

Since A is a symmetric polynomial in ct\, . . . , «4 (or Pi, . . . , P4), it can be 
written as a polynomial in the coefficients of / or g. A well-known calculation 
yields, see e.g. [141 ] . 

A = b 2 c 2 d 2 - Ab 2 c 3 e -4b 3 d 3 + 18 6 3 cde - 276 4 e 2 - 4c 3 d 2 
+ 16 c 4 e + 18 bed 3 - 80 bc 2 de - 6 b 2 d 2 e + 144 b 2 ce 2 

-27d 4 + 144 cd 2 e - 128 c 2 e 2 - 192 6cfe 2 + 256 e 3 (3.5) 

= -Ap 3 q 2 - 27 g 4 + 16p 4 r+lUpq 2 r - 128p 2 r 2 + 256 r 3 . (3.6) 

We also define the square root of A: 

5:= [J (a i -a i )= J] (ft - ft) = (3.7) 

!<«<i<4 i<«<i<4 

Again, the sign of 5 may change if we permute ai, . . . ,014; the sign is pre- 
served by an even permutation but is changed by an odd permutation. 

Let E = K(ai, . . . , 04) = K(0i, . . . ,Pi) be the splitting field of /, or 
g, over K, and let G := Gal(E:K) be the Galois group of the extension 
E D K. The elements of the Galois group G permute the roots 04 (or Pi), 
and G may be regarded subgroup of S4. 

S4 has a normal subgroup V consisting of the 4 permutations 1 (identity) 
and (12) (34), (13) (24), (14) (23). Thus G has a normal subgroup GDV. Let 
the fixed field of Gfl V be F. Then F is a Galois extension of K with Galois 
group G/G n V C 5 4 /y = S 3 . 

Fourier analysis on 1/ is especially simple because every element has order 
1 or 2, and thus every character is ±1 (again, see e.g. 0] or [iotp. We identify 
functions on V by vectors {zi, z 2 , Z3, Z4), with z\ the value at t, and define 
the Fourier transform of (zi, z 2 ,z 3 , Z4) as (zi, 22, S3, S4) with 



Si := 2(^1 + z 2 + 23 + 24), (3.8) 

% ■= \{zi + z 2 - z 3 - z 4 ), (3.9) 

23 := §(*l - z 2 + z 3 - z 4 ), (3.10) 

Z4 := - z 2 - z 3 + z 4 ). (3.11) 



For V, with our chosen normalization, the Fourier inversion formula takes 
the especially simple form z = z, i.e., the Fourier transform is its own inverse. 
(This is easily verified directly.) 
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Since fa+fa+fa+fa = 0, the Fourier coefficient fa = 0. For convenience, 
we shift the indices and define 7, := fa+i, i = 0,1,2,3, where thus 70 = 
0. The Fourier transforms of (a%, a.2, 03, a.4) and (fa, fa, fa, fa) are thus 



(-^,71,72,73) and (0,71,72,73), where 

71 := fa := \{ai + a 2 - a 3 - 04) = /3i + fa = -fa - fa, (3.12) 

72 := £3 := - «2 + «3 - "4) = /3i + /3 3 = -/3 2 - At> (3.13) 

73 := #4 := - «2 - «3 + «4) = Pi + fa = -/3 2 - /3 3 . (3.14) 



Permutations in V act on the vectors (regarded as functions on V) by 
translations in V, and thus on the Fourier transforms by multiplying by 
characters, which are ±1. In other words, permutations in V act on 71, 72 
and 73 by multiplying by ±1 (as is easily seen directly from (|3.12p - (|3.14p ). 
Consequently, if we define 

u := 71 2 = (fa + fa) 2 = (fa + fa) 2 , (3.15) 

v := 7 | = (fa + fa) 2 = (fa + fa) 2 , (3.16) 

w := 73 2 = (fa + faf = (fa + fa) 2 , (3-17) 

then u, v and w are fixed by V n G, and the thus belong to the fixed field 
F. We can easily find them explicitly. If a is any element of the Galois 
group G, then a permutes fa, . . . ,fa, and it follows from ()3. 15[) (|3. 1T|) that 
a permutes u, v, w. Hence any symmetric polynomial in u, v, w is fixed by 
every a G G, and thus it belongs to K. In particular, this applies to the 
coefficients of the polynomial R(x) := (x — u)(x — v)(x — w), which thus has 
coefficients in K. Calculations yield the explicit formulas 

u + v + w = -2p, (3.18) 

uv + uw + vw = p 2 — 4r, (3.19) 

uvw = q 2 . (3.20) 

Hence, u, v, w are the three roots of the cubic resolvent 

R(x) := (x — u)(x — v)(x — w) = x 3 + 2px 2 + (p 2 — Ar)x — q 2 G K[x\. (3.21) 

Remark 3.1. Note that u-v = (fa- fa)(fa- fa), u-w = (fa-fa)(fa-fa), 
v — w = (fa — fa) (fa — fa)- Hence the discriminant (u — v) 2 (u — w) 2 (v — w) 2 of 
the cubic resolvent R equals the discriminant of g or / given by ()3.5p ~ ()3.6p . 

Having found u,v,w, we take their square roots to find 71,72,73- By 
(ED , 717273 = ±Q- In fact, using (^2D - (l3TT41) and fa + fa + fa + fa = 0, 

717273 + Q = 717273 - (fa fa fa + fa fa fa + fafafa + fa fa fa) 

= -(fa + fa)(fa + fa)(fa + fa) + (fa + fa + fa)(fafa + fa fa + fa fa) 
- fafafa 

= 0. 
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Hence, 

717273 = -q, (3.22) 

which provides the information we need on the signs of 71 , 72 , 73 • We then 
find Pi, ... , /?4 by taking the (inverse) Fourier transform of (0, 71, 72, 73)- We 
summarize the resulting algorithm: 

Theorem 3.2. Let g(y) = y 4 +py 2 + qy + r, and define the cubic resolvent 
R(x) by (|3.2ip . Let the roots of R by u,v,w (for example found by Theo- 
rem \2. 5\) . and let 71 := y/u, 72 := \fv, 73 := yfw, where we choose the signs 
so that 717273 = —q. Then the roots of g are given by 

Pi = 5(71 + 72 + 73), (3.23) 

Pi = i(7i -72 -73), (3.24) 

/3 3 = ^(-7i+72 - 73), (3.25) 

/3 4 = ^(-7i-72 + 73). (3.26) 

The roots of f{x) = x + 6x 3 + ex 2 + dx + e are on = Pi — b/A, i = 1, . . . , 4, 
where g(y) := f(y - 6/4). 

Note that changing the signs of some of 71 , 72 , 73 while still preserving 
(|3.22p (i.e., changing the sign of exactly two of them), just yields a permu- 
tation of Pi , . . . , /?4. 

Remark 3.3. The formulas f|3.23p — (]3.26j) were given by Euler @, §5] m 
1733. Euler's motivation was different. For the cubic, Cardano's formula is 
tyU + \fV where U := u 3 and V := v 3 are roots of the quadratic resolvent 
(|2.37p . Further, for a quadratic x 2 = a there is the trivial formula \fa. 
Hence, Euler sought by analogy a formula for the roots of a quartic in the 
form \/A+\/B + \/C, and found a cubic equation for A, B, C by substituting 
in g(y) = 0, see [5j for details. (In our notation, A = n/4, B = v/A 
and C = v/A, so Euler's cubic equation is our R(Ax) = 0; the difference 
from our cubic resolvent equation R(u) = is thus only a trivial matter of 
normalization.) 

Euler [5, §§6-8] proceeded to write the solution as \f~E + \f~F + \[G (with 
E = A 2 , F = B 2 , G = C 2 ), and found another cubic equation satisfied by 
E, F, G. Euler conjectured that similar formulas existed for higher degrees 
too, and in particular that the roots of a fifth degree equation could be found 
as \YA + \Yb + \jC + v^D, where A,B,C,D were the roots of some fourth 
degree resolvent; however, he could not find such a resolvent. Of course, 
we know that Euler's conjecture cannot hold, since 100 years later it was 
proved by Abel and Galois that in general there is no solution by radicals 
for a fifth degree equation. 
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Remark 3.4. A simple calculation yields, by (pT23V(|3T26l) and (|3TTS|) . 

P1P2 + foP* = t (7? " (72 + 7s) 2 + 7? " (72 - 7s) 2 ) 

\ 1 (3-27) 
= 4(27? " 27I - 27I) = -{u-v-w) = u+p 

and similarly 

/3i/3 3 + /3 2 /34 = t>+P, (3.28) 

/3i/? 4 + /3 2 /3 3 = w + p. (3.29) 

Hence, the roots of the cubic resolvent are the three values of /3i(3j + flkfli —p 
for different permutations ijkl of 1234. 

For the roots of / we have, since ctj = /3j — 6/4 and /3i + $2 + /3 3 + ,$4 = 0, 

6 2 6 2 
aia 2 + a 3 a 4 = /3i/3 2 + /3 3 /3 4 + — = u + p + — , (3.30) 

o o 

and similarly a^o^ + 0204 = v + p + b 2 /8, a\a^ + 0203 = u> + p + 6 2 /8. 

Remark 3.5. Another method to solve the quartic equation x 4 + bx 3 + 
ex 2 + dx + e = 0, also due to Lagrange 171 ]. is to form (cf. Remark l3.4p 

si := ai«2 + a 3 a4, S2 := aia 3 + 0204, s 3 := ai«4 + ct2a 3 , (3.31) 

and the cubic polynomial R{z) = (z — s\)(z — S2){z — S3) with these as roots. 
This polynomial can be expressed in the coefficients of the equation as, see 
0, Section 12.1], 

R(z) = z 3 - cz 2 + (bd - Ae)z - d 2 - b 2 e + Ace. (3.32) 

By Remark E31 and ([3T2Tj) . R(z) = R(z-p-b 2 /8), so this is our usual cubic 
resolvent in disguise. 

Having found si,S2,s 3 by solving the resolvent equation R(s) = 0, one 
notes, for k = 1, 2, 3, 

(2 7fc ) 2 - 4s k = b 2 - 4c, (3.33) 

and thus 

7fc = ±^ Sfe - c + 6 2 /4, (3.34) 
which, recalling (I3321) . yields the roots by (l3T23]) - (l336l) . 

Appendix A. Real cubic equations 

Assume that f(x) = x 3 + ax 2 + 6x + c is a polynomial of degree 3 with 
real coefficients. 

Then / has always 3 complex roots (not necessarily distinct, and as always 
given by Theorem I2.5p . but the number of real roots may be smaller. The 
following theorem shows that the number of real roots is 1 or 3, and that 
the discriminant discriminates between the possible cases. 

Theorem A.l. Let A be the discriminant of f given by (|2.10p .- thus A is 
real. 
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(i) If A > ; then f has three simple real roots. 

(ii) If A < 0, then f has one simple real root, and two non-real complex 
roots forming a pair of complex conjugates. 

(iii) If A = 0, then f has either one double and one simple real root, or 
a triple real root. 

Proof. Let, as in Section [21 the 3 roots of / in C be a±, a 2 , 03- Note that 
ai + a 2 + «3 = -b. A is real by pTTD]) . By (i2T9l) . A = if and only if two 
of the three roots coincide, i.e., / has a double or triple root. This root has 
to be real, since otherwise its conjugate would be another double or triple 
root and / would have at least 4 roots (counted with multiplicity), which is 
impossible. If there is a real double root ai = a 2 , then 0,3 = — b — ati — a 2 
is real too. This shows (iii). 

Now suppose that A / 0; thus / has three distinct simple roots in C. 
Since f(x) — > —00 as x — > —00 and f(x) — > 00 as x — > 00, / has at least one 
real root by the intermediate value theorem. Further, since the roots are 
simple, / changes sign at each root, so / has an odd number of real roots. 
Hence / has either 1 or 3 real roots. 

If / has 3 distinct real roots qi, a 2 , «3, then (|2.9p yields A > 0. 

If / has only one real root, then the roots are ai, a 2 , 02 with ai € R and 
a 2 i R, and <^M) yields 

A = [ai — a 2 ) 2 (ai — lx 2 ) 2 {a 2 — a^) 2 = \cti — a2| 4 (2ilm a 2 ) 2 < 0. □ 

Remark A. 2. For a cubic y 3 +py + q without second degree term, A/108 = 
— (p/3) 3 — (q/2) 2 by (|2.10p . and thus the criterion for case (i) is (p/3) 3 + 
(q/2) 2 < 0; equivalents, p < and |p/3| 3 > \q/2\ 2 . This was found already 
by Cardano, see Remark IB. 41 

The number of real roots is thus easily found. Now let us consider finding 
the root(s). There are by Theorem lA.ll three cases, which we treat separately 
since they turn out to be quite different. (Of course, the roots are always 
given by Theorem 12.51 but we now want to perform only real arithmetic, if 
possible.) 

A.l. A = 0, a double or triple root. In the case A = 0, the roots 
are easily found. The double (or triple) root ai = a 2 is also a root of the 
quadratic equation f'(x) = (choosing the root that also satisfies f(x) = 0), 
and then 03 is given by 03 = —b — 2ai. It is easily seen that ai = a 2 = 
—6/3 + 5, and thus 03 = —6/3 — 25, where 5 = i^—p/3 with the correct 
sign given by sign(<5) = sign(g). (See also Remarks 12.71 and IA.51 ) 

A. 2. A < 0, one simple real root. If A < 0, then Cardano's formula 
(I2.36P yields the unique real root of / by choosing the real cube roots. Note 
that —q/2 ± y /— A/108 is real and that the product of the real cube roots 
in (|2.36p is — p/S as required, because the product is a cube root of (— p/3) 3 
and — p/3 is real. 
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A. 3. A > 0, three simple real roots: casus irreducibilis . The case 
A > is much more complicated. Of course, Cardano's formula (]2.36p still 
applies, but now a/— A/108 is imaginary and —q/2 ± y 7 — A/108 complex, 
so the formula necessarily involves taking cube roots of complex (nonreal) 
numbers, even though we know that the final answer is a real root of /; the 
three different choices of cube roots of —q/2 + \J — A/108 lead to the three 
different real roots of fix) = 0. 

In this case the imaginary parts thus cancel in (|2.36[) for any admissible 
choice of cube roots. This can also be seen as follows: since — q/2+^J — A/ 108 
and —q/2 — \J — A/108 are complex conjugates, we may, and have to, choose 
cube roots of them that are complex conjugates in (|2.36p ; recall that the 
product of these cube roots has to be —p/3, which is real. Hence, Cardano's 
formula ()2.36|) for the roots may be written 





(A.l) 

Every complex number may be represented by a pair of real numbers, 
viz. its real and imaginary parts, but taking the cube root of a complex 
number may not be reduced to a combination of real cube roots (or real 
square and higher roots) and usual algebraic algebraic operations. In fact, 
it can be shown by Galois theory that if / is any polynomial with rational 
coefficients (i.e., f(x) G Q[zc]) such that / is irreducible over Q and has 
positive discriminant, and a is a root of /, then a cannot be expressed 
by real radicals; in other words, there does not exist a sequence of field 
extensions Q = Fq C F± C F2 ■ ■ ■ C Fjy where F^ = Fk-i[uk] for some real 
Uk with u? k £ Fk-\ for some positive integer n&, k = 1, . . . , N, and a E F/v; 



sec 



21, Section 8.8] or 0, Section 8.6]. (Here Q may be replaced by any 
subfield of M.) 

The case A > is known as the casus irreducibilis; in this case, thus the 
equation cannot (in general) be solved by radicals using only real numbers 
(somewhat paradoxically, since the answers all are real). This case is, by 
(|2.10p . characterized by — 4p 3 > 27 q 2 , or equivalently 

p < and 4|p| 3 > 27q 2 > (A. 2) 



or 

p < and |p/3| 3 > (q/2) 2 . (A.3) 

An alternative to Cardano's formula (|A.1|) in the case A > are the 
following trigonometric formulas, which involves only real numbers but use 
transcendental functions instead of algebraic expressions. 

Theorem A.3. If the real cubic polynomial f(x) = x 3 + bx 2 + cx + d has 
positive discriminant A > 0, or equivalently, (IA.3j) holds, then f has three 
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real roots given by 

where different branches of arccos yield the three different roots. 
Equivalently, the roots are given by the three different values of 



Proo/. Let z := u 3 = -q/2 + y'-A/lOS = -q/2 + i^/A/108, see flZ3S]). 
Then, using ipTTOjl . 



9 A ^ / \ 3 

' 4 108 ~~ 27 ~ 



We write z in polar coordinates: z = re llp ', where thus r = |z| = (|p|/3) 3//2 
and 

_ Rez _ -<?/2 _ -g/2 
COSl ^ " "H" ~ ~T~ ~ (-P/3) 3 / 2 ' 
By (jA.lh . a root of / is given by 

-^ + 2Rez 1/3 = -■^ + 2Rer 1 / 3 e iv/3 = — + 2V 1 / 3 cos(^/3), 

which yields fjA.4h . with different choices of ip yielding the three roots. 

To see (IA.5j) . let tp := tp + 37r/2 and note that simp = — cosp and 
sin('0/3) = cos(ip/3). □ 

In (|A.4j) and (|A.5P we find the different roots by choosing different values 
of arccos or arcsin. Often it is more convenient to make a single choice (for 
example the principal value with < ip < it or — n/2 < Tp < n/2, but any 
choice will do). 

Theorem A. 4. Suppose that the real cubic polynomial f(x) = x 3 + bx 2 + 
cx + d has positive discriminant A > 0, or equivalently, that ()A.3P holds. 
Then, for any choice of 

* " { {-p% 2 ) ' (A ' 6) 

/ has three real roots 
Similarly, for any choice of 
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/ has three real roots 

Proof. The other possible values of arccos in (|A.6p are ±<p + 27rre, n G Z. 
Hence, the three values of the cos in ()A.4p are cos(c^/3) and 

M ± 27T (/3 27T (/J 2lT 1 if ^/3 if 

cos = cos — cos — =F sin — sin — = — cos — =F — sin — , 

3 3 3 3 3 23 2 3 

and (TO|) yields (CO) . 

Similarly, the other possible values of arcsin in ()A.8|) are ip + 2irn and 
3tt — vp + 2irn, n £ Z, and the three values of the sin in (IA.5|) are sin(i/>/3) 
and 

V>±2VT V 271 " ^ 2VT 1 . "0 , ^ rn 

sin = sin — cos — ± cos — sin — = — sin — ± — cos — . □ 

3 33 332323 

Remark A. 5. The formulas (|A~4"l) - (jAlJ]) are meaningful (with real quanti- 
ties only), exactly when p < and \q/2\ < \p/3\ 3 ^ 2 , i.e., when (|A.3|) holds 
or in the limiting case A = and p / 0. The formulas (|A.4|) - (|A.9P are 
valid in the latter case too, and then yield the roots —6/3 + S, —6/3 + 5, 
—6/3 — 25, where 5 = i-y/— p/3 with sign(<5) = sign(g), as found more easily 
in Subsection lA.il 

Example A. 6. Let f(x) = x 3 — x, which evidently has the three real roots 
0, ±1. 

We have 6 = 0, p = c = -1, q = d = 0, and, by (|2.10p . A = 4 (which is 
verified by §ZS§). Hence, u 3 = -q/2 + ^/-A/108 = y/-l/27 = 3" 3 / 2 i, and 
we find the three cube roots 

U1 = 7T 

-i / 1 V3\ 1 i 
U2 = -t= -77 = 77 + 



v^^ 2 2 / 2 2^3' 
-iy I .V3\ 1 i 



v^^ 2 2 J 2 2x/3' 
Hence (|2.35p and (|2.36p yield the three roots of / as 

Ml+MT = 0, U2 + U2 = l, U3 + U3 = — 1. 

Alternatively, we may use the trigonometric formula (lA.4p . We have 
arccos [—(q/2)/ (— p/3) 3//2 ) = arccos = 7r/2 + n7r, n G Z, and thus the three 
roots are 
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Similarly, (|A.5j) yields, since arcsin((g/2)/(— p/3) 3 / 2 ) = arcsinO = mr, 
n£Z, the three roots as 

2 2 2vr 2 4vr 

-^sinO = 0, — = sin — = 1, — = sin — = — 1. 

y/3 \/3 3 ^3 3 

Using (|A.7p with if = tt/2 we find the same roots given as 

2 7T 1 / 7T /- . 7T 

— = cos — , = cos — ± v o sm — 

^3 6' V3V 6 6 

while (|A.9P with ip = yields 

2 1 / \ 
-^sinO = 0, = sin0±\/3cos0 ) = =pl. 

Example A. 7. Let f(x) = x 3 — 7x — 6, which has the roots 3, —1, —2. 
We have p = -7, q = -6 and A = 400. Thus, ti 3 = -q/2 + ^/-A/108 = 
3 + y 7 — 100/27 = 3 + p^-i, and we find the three cube roots 

2 . 15. 31. 

Ul = — 1 H =1, Mo = =1, to = 1 =1. 

y/3 2 2^3 2 2^3 

Hence, (|2.35p and (|2.36p yield the three roots of / as 

2Reui = -2, 2Re«2 = -l, 2Reu 3 = 3. 

The trigonometric solution (|A.4|) yields 



/7 /l /243 2vrA; \ . 

2 V 3 C ° S I 3 arCC ° S V 343 + ~ J ' k = > 1 > 2 > ( A - 10 ) 

and it is far from obvious that this yields three integers 3, —2, —1. 

Example A. 8. Let f(x) = x 3 — 7x 2 + 14x — 8, which has the roots 1, 
2, 4. Then, by jg^-gju]) , p = -7/3, g = -20/27 and A = 36. Thus, 
u 3 = —q/2 + y 7 — A/108 = Tff + "^|> an d we find the three cube roots 

2 1 1^/3 5 1 

Ul = -3 + V = 3^ U2 = -6-V' M3 = 6 + 27I 1 - 

Hence, (|2.36p yields the three roots of / as 

7 7 7 

- + 2Reui = l, - + 2Reu 2 = 2, - + 2Reto = 4. 

3 3 3 

The trigonometric solution (1A.4P yields 

7 2y/7 (I 10 2itk\ , , A . 

- + -y cos ^- arccos — =. + — J , A; = 0,1,2, (A.ll) 

which, again surprisingly, yields three integers 4, 1, 2. 



16 



SVANTE JANSON 



Example A. 9. Cardano saw the problem with the casus irreducibilis and 
asked Tartaglia about it, giving x 3 = 9x+10 as an example, see Remark IB. 41 
In this case, thus f(x) = x 3 — 9x — 10, so p = —9, q = — 10 and A/108 = 
(-p/3) 3 - (q/2) 2 = 2. Thus, u 3 = -q/2 + ^/-A/108 = 5 + >/2i, and we find 
the three cube roots 

ui = -l + V2i, u 2 = — ^ 1 2 *' U3 = — 2 2 

Hence, (|2.35p and (|2.36p yield the three roots of / as 

2Reui = -2, 2Ren 2 = l + \/6, 2 Re u 3 = 1 - V6. 

Example A. 10. Cardano [3j, Chapter XIII] considered also the equation 
y 3 = 8y + 3. He saw that y = 3 is one solution (without discussing the 
problem of the casus irreducibilis, see Remark IB.4|) . In modern terms he 
then found the other two solutions by finding the roots of the quadratic 
polynomial (y 3 — 8y — 3)/(y — 3) = y 2 + 3y + 1; he gave a general formula 
for this. (The other two solutions are —(3 ± v5)/2; these are negative, and 
Cardano changes the sign and interprets the result (3 ± y/E)/2 as the two 
positive solutions of x 3 + 3 = 8x.) 

Let us instead use Cardano's formula. In this case, p = —8, q = —3 and 
A = -4p 3 - 27q 2 = 1805. Thus, 



u 



3 



19\/5. 



-q/2 + V-A/108 = - + — ^=-i, (A.12) 

and we find the three cube roots 

3 y/E V5-3 9 + V5 V5 + 3 9 - \/5 

iti = — I ^i, U2 = ; ^— ii 1*3 = ; 1 

2 2^3 4 4^/3 4 4^3 

Hence, (|2.35f) and (|2.36[) yield the three solutions of y 3 = 8y + 3 as 

2Reui = 3, 2Reu 2 = -(3- \/5)/2, 2Reu 3 = -(3 + VE)/2. 

Example A. 11. Bombielli (1550) considered the equation y 3 — 15y — 4 = 0. 
Cardano's formula (|2.35|) yields the roots as 



2 + V-121 + V 2 - V-121 = \/2 + Hi +\/2- Hi. (A.13) 

Bombielli noted that 4 is a root, and showed in a pioneering calculation with 
complex numbers that (2 ± i) 3 = 2 ± Hi, and thus (|A.13P correctly yields 
the root (2 + i) + (2 - i) = 4. 

The two other cube roots of 2 + 11 i are 

y/3 2^/3-1 \/3 -2^/3-1 

Uo = — 1 1 1, to = — 1 H 1 1. 

i 2 2 2 2 

Hence, the three solutions of y 3 — Voy — 4 = are 4 and — 2 ± y3- 
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A. 4. Further comments for real coefficients. 

Remark A. 12. In the casus irreducibilis we thus always obtain the roots as 
complicated expression involving complex cube roots, even when the roots 
are, for example, simple integers. (See Examples I A . 61fA. 1 1 1 for simple cases.) 

Also in the case of a single real root, Cardano's formula typically yields 
complicated expressions (but now involving real roots only) also for simple 
integer solutions. 

Example A. 13. The equation x 3 + 6x = 20 0, Chapter XI] has the real 
root x = 2 (and the complex roots —1 ± 3i). Cardano's formula yields the 
root as 

10 + VW8 + \JlO-VW8 = yth/M - y/VW§- 10 (A.14) 

This indeed equals 2 because y/108 ± 10 = (y/3 ± l) 3 , but this is far from 
obvious and it is hard to see how (|A.14j) can be simplified without knowing 
the answer. 

Remark A.14. The trigonometric formulas (|A.4p - (|A.9|) are valid also when 
A < (and, more generally, for arbitrary complex coefficients with p ^ 0), 
but then the angles ip and ip are complex and the formulas less useful. 

For real coefficients with A < (the case of a single real root), we can 
choose (f in (|A.6|) or ip in (|A.8[) purely imaginary (after a change of sign of 
the roots and q in the case p < < q), and the formulas (IA.7j) and (IA.9jl 
can be rewritten with real hyperbolic functions as follows [121 ]. 

Theorem A. 15. If the real cubic polynomial f(x) = x 3 + bx 2 + cx + d has 

negative discriminant A < ; then f has one real and two conjugate complex 
roots given as follows. 

(i) If p < and q/2 < — |p/3| 3//2 ; then the roots are 

b n f-P u f l uf 'I/ 2 \\ 

h 2\ — cosn - arccosn ■ — ^7- , 

3 V 3 \s V(-p/3) 3 /v; 

-I - ^ h G — fc^O) ± «™ G — ■ (^) 

(ii) If p < and q/2 > \p/3\ 3 ^ 2 , then the roots are 

-\ - cosh (5 arccosh (f^)) ' 

-I + ^ ( cosh (i ^ (_^) ) ± ^ (I _ h (_^) 

(hi) If p > 0, £/ien i/ie roots are 
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Remark A. 16. The three cases in Theorem lA.ll can also be se en g eomet- 
rically by considering the graph of / (or g) as follows (based on |18|]). 

Denote the stationary points of g, i.e. the roots of g'(y) = 0, by ±5; thus 
the stationary points of / are — | ± S. (Note also that / has an inflection 
point at (— §,#).) Since g'(y) = 3y 2 + p, we have 

5 = V^P/S. (A.15) 

In particular, 5 is either real (p < 0) or imaginary (p > 0); when p < 0, we 
choose for convenience the positive square root in (|2.1ip . Let further 

h:=-(g(5)-g(0)) =-{5 3 +p5) = 25 3 . (A.16) 

(We choose this sign so that h > when 5 > 0.) Thus 

f(-l±5) = g(±5)=g(0)Th = qTh. (A.17) 

If 5 > 0, then / thus has a local maximum at — | — e) with value q + h, 
and a local minimum at — | + <5 with value q — h. Considering the graph of 
/, we see that f(x) = then has three real roots if 6 (q — h,q + h), two 
real roots (of which one double) if = q ± /i, and one (simple) real root if 
^ [g — /i, (? + h] . We thus see the three different cases in Theorem IA.H with 
(i) if h > \q\, i.e. h 2 > q 2 , (ii) if h < \q\, i.e. h 2 < q 2 , and (hi) if h = \q\, i.e. 
h 2 = q 2 . 

In the limiting case 5 = (which entails h = 0), / has no local maximum 
or minimum, but a saddle point at — | with /(— |) = <?(()) = g. In this case 
there is a triple root (case (hi)) if q = and otherwise one simple real root 
(case (ii)). 

If (5 is imaginary (and non-zero), then f'(x) ^ for all real x, and thus 
f'(x) > (since this certainly holds for large x); hence / is strictly increasing 
and f{x) = has a single, simple root for any q (case (ii)). In this case, h 
is imaginary too, so h 2 < < q 2 . 

We thus see that in all cases, Theorem I A. II holds with case (i) when 
h 2 — q 2 > 0, (ii) when h 2 — q 2 < 0, and (hi) when h? — q 2 = 0. This is also 
confirmed by a simple calculation showing that 

A = -V - 27 q 2 = 108<5 6 - 27 q 2 = 27 {h 2 - q 2 ). (A. 18) 

Using parameters 5 and h also simplify the formulas above a little. Since 
h 2 = 4<5 6 = -4p 3 /27, (gjjgp becomes 

u 3 = l(-q+y/tf—tf) (A.19) 
so Cardano's formula ()2.36j) for the roots of / becomes 

- \ + y^HW?^)" + v/^-g-x/^ 7 ^)- (A.20) 
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Similarly, in the casus irreducibilis, which now is characterized by h > \q\, 
(fOl) and (|X5|) can be written 

— - + 25 cos f— arccos^— j^j = ~~ ^ + 25 sin f— arcsin^^ . (A.21) 

Remark A. 17. Consider the case of a real cubic f(x) = ax 3 + bx 2 + cx + d 
with a single real root (A < 0). A geometric way to find the two complex 
roots from the graph of / (on R) has been given by e.g. (13 : 11]: Let A be 



the intersection of the curve y = f(x) and the x-axis (i.e., the real root), and 
let £ be a tangent from A to the curve. If the tangent point has x-coordinate 
Xq, and the slope of the tangent is k, then the complex roots are x$ ± \f~ki. 

Appendix B. Histor\0 

The solution to cubic equations was first found c. 1515 by Scipione del 
Ferro (1465-1526) in Bologna, at least for some cases. At this time, negative 
numbers were not used, nor was 0. Thus (in modern notation) x 3 + px = 
q, x 3 = px + q and x 3 + q = px, with positive p and q, were regarded 
as different types of equations. (The third type was often ignored. We 
know today that it always has one negative solution, which generally was 
disregarded, and either zero positive solutions or two (casus irreducibilis); 
hence Cardano's formula will never yield a positive solution using real roots 
only. The negative solution was treated by Tartaglia and Cardano by, in 
modern terms, changing the sign of x, which transforms x 3 + q = px to 
x 3 = px + q.) Scipione del Ferro could solve the first type and possibly the 
second (researchers disagree). 

Remark B.l. There are 13 types of nontrivial cubic equations with positive 
coefficients: x 3 + cx = d, x 3 = cx+d, x 3 + d = cx, x 3 = bx 2 + d, x 3 + bx 2 = d, 
x 3 + d = bx 2 , x 3 + bx 2 + cx = d, x 3 + cx = bx 2 + d, x 3 + bx 2 = cx + d, 
x 3 = bx 2 + cx + d, x 3 + d = bx 2 + cx, x 3 + cx + d = bx 2 , x 3 + bx 2 + d = cx. 
These are, for example, discussed separately in Cardano's Ars Magna 0, 
Chapters XI-XXIII]. 

Similarly, Cardano 0, Chapter V] considers three different types of qua- 
dratic equations: x 2 = bx + c, x 2 + bx = c, x 2 + c = bx (as did al-Khwarizmi 
c. 800, while Brahmagupta in 628 used both zero and negative numbers 
and treated all quadratic equations together), and many types of quartic 
equations, see Remark IB.5I below. 

Note that Cardano discusses negative numbers and negative solutions 
(called "false solutions" ) to equations [3j, in particular Chapters I and XXXVII] ; 



^This appendix is largely based on the Foreword (by Oystein Ore) and Preface (by T. 

Richard Witmer) to the English translation of Ars Magna [3|, the text itself of Ars Magna 
j|, Katscher [TBI: [l6|. van der Waerden (22|, and The MacTutor History of Mathematics 
19( | on the Internet (the articles Quadratic, cubic and quartic equations; Scipione del 

Ferro; Nicolo Tartaglia; Girolamo Cardano; Lodovico Ferrari; Tartaglia versus Cardan); 

much more details can be found in these references. For the history after Cardano's Ars 

Magna see van der Waerden [2^ ]. 
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however, he does not consider negative coefficients (at least not usually, al- 
though there are occasional uses in a few examples, for example [31, Chapter 
XXXIX, Problem IX]). 

Cardano even makes a pioneering tentative use of imaginary numbers 
and complex solutions 0, Chapter XXXVII, Rule II], although he clearly 
does not understand them and he seems sceptical to his calculation. Com- 
plex numbers were introduced in a consistent way somewhat later by Rafael 
Bombelli (1526-1572) in his book Algebra (1572), where he also shows how 
to work with negative numbers fli^ . Rafael Bombielli]. 

Remark B.2. It is claimed in (lil . Scipione del Ferro] that the reduction 
(|2.2p to an equation without quadratic term (which seems trivial to us) 
was known at the time of del Ferro, but this seems incorrect, and I rather 



believe the claim by [15(] that del Ferro considered only such cubics because 



the others were too difficult to be solved. See further Remark lB.31 

However, del Ferro kept his solution secret. The traditional story is that 
he did not tell anyone about it until his deathbed in 1526, when he told the 
solution to his student Antonio Maria Fior. (This seems a bit exaggerated, 
since his son-in-law Hannibal della Nave much later showed Cardano a note- 
book written by del Ferro presenting the solution, but he certainly told very 
few.) 

Fior let it become known that he could solve cubic equations (without 
disclosing the method). This prompted Nicolo Tartaglia (1500-1557) in 
Venice to find solutions. He first found a solution to some equations of 
the type x 3 + bx = d. He claims [16, XIIII p. 12, XXV p. 15, p. 64] that 
he found the solution to all such equations in 1530, but he really could 
solve (and construct) only special cases, in modern terms having a negative 
integer solution. A public contest was held between Fior and Tartaglia in 
1535, where each was to solve 30 problems set by the other (within 40 or 
50 days); according to himself 0, XXV p. 13], Tartaglia managed to find 
the solutions to the two types x 3 + px = q and x 3 = px + q on 12 and 13 
February 1535, only 8 days before the deadline of the contest!!, and then 
Tartaglia easily won by solving all 30 problems in 2 hours. (Fior's problems, 
which are given in [161 . XXXI pp. 29-31], were all of the type x 3 + px = q, 
which he did not believe that Tartaglia could solve.) 

Girolamo Cardano (1501-1576) in Milan then invited Tartaglia, and man- 
aged to make him disclose the method (25 March 1539), after Cardano 
had promised Tartaglia to keep it secret until Tartaglia had published the 
method himself (something Tartaglia never did, preferring to keep it secret 
and regretting that he had told Cardano). Cardano worked on the solution 
together with his young assistant Lodovico Ferrari (1522-1565), who in 1541 
found a solution to quartic equations. 



According to [H, XXXI p. 29], the contest was on 22 February, which yields a dis- 
crepancy in the exact dates. 
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Cardano found out that the cubic equation had been solved by del Ferro 
before Tartaglia, and used this as an excuse to break his promise to Tartaglia 
and publish (in 1545) the solutions of cubic and quartic equations in his large 
algebra book Ars Magna [H, where they form a major part. (All 13 types 
of cubic equations are discussed separately in detail, but only some of the 
possible quartic equations, see below.) Cardano introduces the solution of 
the cubic equation with: 

Scipio Ferro of Bologna well-nigh thirty years ago discovered 
this rule and handed it on to Antonio Maria Fior of Venice, 
whose contest with Niccolo Tartaglia of Brescia gave Nic- 
colo occasion to discover it. He [Tartaglia] gave it to me in 
reponse to my entreaties, though withholding the demonstra- 
tion. Armed with this assistance, I sought out its demonstra- 
tion in [various] forms. This was very difficult. My version 
of it follows. @, Chapter XI] 

The publication led to a bitter dispute between Tartaglia and Cardano- 
Ferrari. Tartaglia accused in a book [la . XXXIIII p. 42] (1546) Cardano 
of breaking an oath to him to keep the solution secret; he also added some 
insults against Cardano. This led to a series of equally insulting pamphlets 
(6 each) by Ferrari (defending Cardano, who kept a low profile in the dispute) 
and Tartaglia (renewing his accusations and insults), and finally to a public 
contest between Tartaglia and Ferrari in Milan on 10 August 1548. (Each 
posed 62 problems to the other. Ferrari won clearly; Tartaglia left Milan 
after the first day of the contest, when he saw that he was losing.) 

Remark B.3. del Ferro, Fior and Tartaglia (with the exception x 3 +bx 2 = d 
discussed above) considered only cubics without second degree term, see 
Remark IB.2| It seems that the reduction (I2.2p of general cubic equations 
to this case is due to Cardano, who in 0] uses this reduction in 9 of the 
10 types with a quadratic term (the exception is x 3 + d = bx 2 , which is 
reduced by the substitution x = d 2 / 3 /y). (Cardano claims in the beginning 
of [§] that those things to which he has not attached any name are his 
own discoveries. This is of course no proof that this reduction is his own 
invention, but it suggests that he regarded the reduction either as his own 
contribution or trivial.) Note that Cardano does the reduction separately for 
each type and that he does not discuss the reduction in his earlier chapters 
on some transformatons of equations. Moreover, he surprisingly does not use 
the corresponding reduction for fourth degree equations (see Remark IB.5|) . 
Furthermore, Tartaglia did not know this reduction (until he read 0]); note 
that Tartaglia himself only mentions cubics without second degree term in 
the poem that he later claimed that he gave Cardano with the solution (see 
Remark IC.ip . and that when he claims to have solved x 3 + bx 2 = d in 1530, 
he says that he had not been able to solve x 3 + bx 2 + cx = d [l6l . XIIII p. 12] 
(and there is no indication that he found a solution later). 
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Remark B.4. Cardano quickly realized the problem with the casus irre- 
ducibilis, see AppendixEl and wrote to Tartaglia about it on 4 August 1539 
XXXVIII p. 48], giving the correct condition for it (see Remark IA.2jl 
and giving x 3 = 9x + 10 as an example (see Example IA.9j) . Tartaglia was 
no longer cooperative, but it seems that neither Cardano nor Tartaglia un- 
derstood how to handle this case. 

Cardano ignores the complications of the casus irrcducibilis in Ars Magna 
0| . In 0, Chapter XIII] he solves y 3 = 8y + 3, and claims that he obtains 
y = 3 (which clearly is a solution) by his method, which seems to be at best 
an oversimplification. (Cf. Example IA. 101 ) 

Remark B.5. Cardano lists [3j, Chapter XXXIX] 20 types of quartic equa- 
tions that he can solve; these are the 10 nontrivial cases without cubic term 
(excluding the ones with only even powers of x, which are quadratic equa- 
tions in x 2 ) and, symmetrically, the 10 nontrivial cases without linear term 
(which are reduced to the former by inversion). 

Cardano states that these cases "are the most general as there are 67 
others" ; I do not understand which these 67 other cases are. Moreover, there 
are 15 cases with all possible terms (cubic, quadratic, linear and constant), 
and 7 additional without quadratic terms; these are not mentioned as far as 
I can see. 

Cardano gives several examples where quartic equations are solved by 
Ferrari's method (see Appendix [D]); these examples illustrate 4 of the 10 
types without cubic term and 2 of the 10 types without linear term, and it 
is clear that the method applies to all 20 types. 

There is also a single example of an equation with both linear and cubic 
terms (x 4 + 2x 3 = x + 1, @, Problem XXXIX.XIII]), but this is solved by 
special argument reducing this equation to a succession of two quadratic 
equations (the equation implies (x(x + l)) 2 = x(x + 1) + 1 so x(x + 1) is the 
golden ratio (\/5 + l)/2). 

Note that Cardano [3j does not use the general reduction (|3.2p to eliminate 
the cubic term (in analogy with his treatment of cubic equations), which, 
together with Ferrari's method, would have given the solution of all types 
of quartic equations. I do not know whether this reduction, and thus the 
solution to general quartics, was found by Cardano, Ferrari or someone else. 

Appendix C. del Ferro's solution of the cubic equation 

Of course, del Ferro, Tartaglia and Cardano did not know Galois theory 
when they found the solution in Theorem 12.51 Their method is more direct, 
and consists in observing (by a stroke of genius) that if y = u + v, then 

y 3 = (u + v) 3 = u 3 + v 3 + 3uv(u + v) = u 3 + v 3 + 3uvy; (C.l) 

hence, if we can find two numbers u and v such that 

u 3 + v 3 = -q (C.2) 

3uv = —p, (C3) 
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then y 3 = — py — q, so y is a root of g(y) = 0. Note that (lC.2j) - (lC.3[) are the 
same as (12. 27ft and (|2.34p . To find u and v, we multiply (|C.2j) by u 3 and 
substitute fjC .3|) . yielding 

u 6 + qu 3 + (-p/3) 3 = 0. (C.4) 

This is a quadratic equation in u 3 , which is readily solved and yields fj2.32|) : 
then u is found by taking the cube root and v is found from (|C.3|) . We see 
that this yields the same u and v as the argument in Section[2j (In particular, 
(|2.33p holds, which shows that choosing the other root in (IC,4j) just means 
interchanging u and v, which does not change the root u + v, this should 
be no surprise, since u and v have identical roles in the ansatz y = u + v.) 
Consequently, this straightforward method yields the same solution u + v 
as given in (|2.24p and (|2.35|) , and we obtain another proof of Theorem 12.51 
(To see that the three different choices of u as a cube root of u 3 really yield 
the three different roots of g(y) = 0, with correct multiplicities if there is a 
double root, is perhaps less obvious by this method. We do not give a direct 
proof since we already know from Section [2] that this indeed is the case.) 

Remark C.l. Actually, the method just described, with y = u + v, is 
Tartaglia's and Cardano's (and possibly del Ferros's) version for the equation 
y 3 = cy + d (with c, d > 0) [3, Chapter XII] which corresponds to our p < 0, 
q < 0. For the equation y 3 + cy = d, which corresponds to our p > 0, q < 0, 
del Ferro, Tartaglia and Cardano instead set y = u — v [3|, Chapter XI], 
using 

y 3 = (u - v) 3 = u 3 -v 3 - 3uv(u - v) = u 3 - v 3 - 3uvy, (C.5) 

and then find u and v such that u 3 — v 3 = —q and 3uv = p. This just means 
changing the sign of v in the equations above, which of course yields the 
same final result. (But it keeps u and v positive in both cases.) 

The third case without second degree term, y 3 + d = cy is reduced by 
Cardano to the case y 3 = cy+d [3j, Chapter XIII], essentially by substituting 
— y for y, although Cardano expresses this differently. 

According to Tartaglia [la . XXXIIII pp. 42-43], he gave these rules 25 
March 1539 to Cardano in form of the following poem (English translation 
from Tartaglia versus Cardan]): 

When the cube and things together 

Are equal to some discreet number, 

Find two other numbers differing in this one. 

Then you will keep this as a habit 

That their product should always be equal 

Exactly to the cube of a third of the things. 

The remainder then general rule 

Of their cube roots subtracted 

Will be equal to your principal thing 

In the second of these acts, 
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When the cube remains alone, 
You will observe these other agreements: 
You will at once divide the number into two parts 
So that the one times the other produces clearly 
The cube of the third of the things exactly. 
Then of these two parts, as a habitual rule, 
You will take the cube roots added together, 
And this sum will be your thought. 
The third of these calculations of ours 
Is solved with the second if you take good care, 
As in their nature they are almost matched. 
These things I found, and not with sluggish steps, 
In the year one thousand five hundred, four and thirtyJl 
With foundations strong and sturdy 
In the city girdled by the sea. 
The Italian original (which rhymes in the form terza rima) is [1 
Quando chel cubo con le cose appresso 
Se agguaglia qualche numero discreto 
Trouan dui altri differ enti in esso. 

Dapoi terrai questo per consueto 
Che'llor produtto sempre sia eguale 
Alterzo cubo delle cose neto, 

El residuo poi suo generale 
Delli lor lati cubi ben sottratti 
Varra la tua cosa principale. 

In el secondo de cotestiatti 
Quando che'l cubo restasse lui solo 
Tu osseruarai quest'altri contratti, 

Del numer farai due tal part'a uolo 
Che l'una in l'altra si produca schietto 
El terzo cubo delle cose in stolo 

Delle qual poi, per communprecetto 

Torrai li lati cubi insieme gionti 

Et cotal somma sara il tuo concetto. 

El terzo poi de questi nostri conti 
Se solue col secondo se ben guardi 
Che per natura son quasi congionti. 

Questi trouai, & non con pafii tardi 
Nel mille cinquecente, quatroe trenta 
Con fondamenti ben sald'e gagliardi 



'Venice reckoned the year from 1 March, so February 1735 was still 1734 in Venice [l5| . 
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Nella citta dal mar'intorno centa. 

Remark C.2. An equivalent, and somewhat quicker, way to obtain Car- 
dano's formula is to use Viete's substitution y = u—p/ (3u) in y 3 +py+q = 0, 
which yields (|C.4|) directly. (This is obviously equivalent to setting y = u + v 
with 3uv = —p as above. See [22I Chapter 3] for Viete's version of this.) 

Appendix D. Ferrari's solution of the quartic equation 

Consider again a fourth degree polynomial g(y) = y 4 + py 2 + qy + r as 
in (|3.3|) . The solution to the equation g(y) = given in Theorem 13.21 is not 
the solution originally found by Ferrari and presented by Cardano in Ars 
Magna 0, Chapter XXXIX] (cf. Appendix [B]) . 

Ferrari's method is as follows (in a modern version). From y 4 +py 2 + qy + 
r = we obtain, for any z, 

(y 2 + z) 2 = y 4 + 2y 2 z + z 2 = (2z - p)y 2 -qy + z 2 -r. (D.l) 

We let z := (p + u)/2 and obtain, for any u, 

y h — 7T- = u v -qy^ - A r - ( D - 2 ) 



2 J a 4 

The right-hand side is a quadratic polynomial in y, and its discriminant is 

q 2 - Au ( ^ P+ ^ - = -u 3 - 2pu 2 - p 2 u + 4ru + q 2 = -R(u), (D.3) 

where R is the cubic resolvent (|3.21|) . Hence, if we choose u as a non- 
zero root of R, then the right-hand side of (jD.2|) is the square of a linear 
polynomial. More precisely, if we further let 7 = -y/u, then the right-hand 
side of (1TI2|) is 

uy 2 - qy + \- = u(y - =Uy-^-), (D.4) 



4u V 2u) V 27/ 



and thus ([Tl2]) yields 



Consequently, 



^T^(--^)- ' M > 

This yields a pair of quadratic equations in y, whose solutions are the four 
roots of g(y) = 0. (It thus suffices to choose one non-zero root of R(u) = in 
order to find all roots of g{y) = 0. See Remark ID . 21 below for a justification.) 

Remark D.l. Ferrari and Cardano considered, as said above, only equa- 
tions with positive coefficients (putting some of them on the right-hand side), 
so they used different versions of the method for different signs of our p, q 
and r, but the versions are essentially the same. 
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Moreover, in the original version, first y + py is completed to a square 
(usually, at least), yielding 

(Wf) 2 = -qy + ^-r; (D.7) 

then this is further modified by considering (y 2 + p/2 + t) 2 and choosing t 
so that the right-hand side becomes a square. This is obviously equivalent 
to the one-step completion of a square above, with z = p/2 + 1. We further 
made the substitution t = u/2 in order to obtain the same form of the cubic 
resolvent as before. 

See for a detailed study of Cardano's solutions to quartics. 

We can connect Ferrari's method and the methods in Section [3] as follows, 
using the notation in Sectional The roots Pi, 1^2 are ^71 db ^(72 + 73), and 
are thus the roots of the quadratic equation, using ()3.16p — fj3.18|) and (|3.22p 
and assuming 71 7^ 0, 

(2y - 71) 2 = (72 + 7s) 2 = v + w + 2 72 7 3 = -2p - u - 2q/ ll . (D.8) 

This equation can be rewritten, since 71 = y/u, 

% 2 - 4^/7! + 2u + 2p + 2q/ lx = 0, (D.9) 

Ay 2 + 2n + 2p = 47iy-2g/7i, (D.10) 

^ h — 5 — = Ti?/ - 75 — • ( D -H) 
2 271 

The other two roots P^,Pa are obtained by replacing 71 by —71, the other 
square root of u. 

We have thus obtained the equations (ID.6D . with 7 = 71 := \/u. 

Remark D.2. This derivation of (1D.6|) from Theorem [321 shows clearly that 
the two roots of each of the two quadratic equations in (]D.6jl together yield 
the four different roots of g[y) = 0. Typically, the four roots are distinct 
and we obtain all roots once each, but even when g has multiple roots and 
there are repetitions in the roots of (|D.6j) . we obtain the roots of g with 
correct multiplicities from (1D.6|) . 

Remark D.3. We started above with a reduced quartic y +py 2 + qy + r (as 
did Cardano and Ferrari), but, as noted by Lagrange 0,' no. 27] the method 
can also be applied directly to a general quartic f[x) = x 4 + bx 3 + cx 2 +dx + e 
by expanding (x 2 + \x + 1) 2 and using f(x) = in analogy with (jD.ip ; we 
then continue as above, obtaining a cubic resolvent equation for t, etc., see 



171 . no. 27] or (J, Section 12.1.C] for details. The resolvent equation for t 



becomes 

o c n bd - 4e (4c - b 2 )e - d 2 _ , 

t 3 1 2 H 1 + ± '- = 0. (D.12 

2 4 8 v 1 

Comparing with (ID. ID . and recalling y = x + 6/2, we have t = z + b 2 /16 = 

u/2 + p/2 + 6 2 /16 and thus u = 2t — p — b 2 /8, so the resulting cubic resovent 

equation (1IXI2"]) is R(2t - p - b 2 /8) = 0, with R given by (EOT]) . Using 
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Remark 13.51 this can be written as R(2t) = 0, as also follows from (|D.12j) 
and (|3.32p . so the roots of this resolvent equation are simply Sj/2, i.e., 
^(aiOtj + ockO>i) for permutations ijkl of 1234. 

Remark D.4. Expressed in the roots we have by Remark 13.41 

z = (u + p)/2 = (p 1 fo + p 3 p A )/2. (D.13) 

This also follows by (jD.lip . which implies fiifa = (u + p)/2 + 9/271 and, 
replacing 71 by -71, /3 3 /3 4 = (u + p)/2 - 9/271. 

Remark D.5. Ferrari's method has the following geometric interpretation 
in algebraic geometry, see @j and [l| for details. 

Let w := y 2 . Then (y, w) is a simultaneous solution of w 2 +pw + qy+r = 
and w — y 2 =0, and thus also of the linear combination 

w 2 + pw + qy + r + u(w — y 2 ) = 

for any u. As w varies, this equation defines a family (called pencil) of 
quadratic curves (also known as conies) in the (y, u^-plane. A calculation 
essentially equivalent to the argument above shows that this conic is singular, 
and thus a union of two lines, exactly when R(u) = 0, and then the two 
lines are w + (p + u)/2 = ±(72/ — 9/27) with 7 = y/u (assuming a / 0), 
corresponding to (|D.6p . Ferrari's method thus can be seen as finding one 
singular conic in the pencil and decomposing it into a pair of lines; the 
solutions then are given by the intersections between these lines and the 
conic w = y 2 . 

Remark D.6. Descartes gave in 1637 yet another method to solve quartic 
equations (see e.g. Descartes' s method is based on trying to factor 

g(y) = (y 2 + ky + l)(y 2 + my + n) by identifying the coefficients, which yields 
the equations 

k + m = 0, km + I + n = p, kn + Im = q, In = r. (D.14) 

This yields m = —k and, after some algebra, R(k 2 ) = 0, where R is the 
cubic resolvent (|3.2ip . Hence we can solve R(u) = 0, choose one root u, let 
k := 7 := y/u and m := —k (the other square root of u); solving for / and n 
then yields (for u / 0) 

(\ f 2 , . P + u q\( 2 .P + u q\ , n .,, 

g(y) = [y +72/ + —^ ^){ y ~ iy + ^2~ 2^)' ^ ' 

Consequently, we see again that g(y) = is equivalent to (|D.6p . 
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